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(57) Abstract 

An apparatus (10) is provided for classifying information or content servers on a communications network including the Internet The 
apparatus (10) comprises means for obtaining one or more transmission characteristics of information on a path of said communications 
network and analysing means for predicting a classification of said information based on said one or more transmission characteristics. 
Typically said one or more transmission characteristics include any one or more of network protocol, date and time stamps, size of 
transmission activities (text and image), content type of transmission activities, pattern seen within the content of the transmission and 
any other characteristic that can be employed for predicting classifications. The apparatus (10) can be adapted toclassify user profiles in 
accordance with the predicted classification. A knowledge base of predetermined profiles can be included, and the analysing means is 
adapted to predict a classification based on a comparison between the profile of information to be classified and the predetermined profiles. 
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APPARATUS AND SYSTEM FOR CLASSIFYING AND 
CONTROL ACCESS TO INFORMATION 

TECHNICAL FIELD OF THE INVENTION 
THIS INVENTION relates to apparatus and system for classifying information 
on communications network and in particular but not limited to apparatus and 
system for classifying content servers and for selectively controlling access to 
classified content servers. 

BACKGROUND OF THE INVENTION 
The phenomenon growth of information technology has allowed many 
people to have access to diverse information on communications networks'. The 
Internet in particular allows fetching of information from any cooperating 
computers or content servers located in different parts of the world by simply 
clicking references to the information. As the number of accessible computers or 
content servers and the amount of information over the communications network 
grow daily it becomes increasingly difficult to classify them manually. 

Known systems for controlling the types of information accessible on a 
network rely on comparing a requested destination with those on pre-determined 
Access Control Lists (ACL) or on word matching to determine whether to allow or 
deny access. This approach can be applied at the client node prior to requesting the 
information or on any suitably intelligent network device capable of intercepting 
the request or subsequent reply prior to it reaching the requester. For example, in 
the case of an Internet browser running on a PC or work station, a request is made 
for an Internet resource such as a web site. A software program for monitoring such 
requests on the PC can be configured to scan a pre-determined list of site addresses 
for a match. If found, access to the site may be denied and a suitable message is 
then displayed informing the user that access is denied. Alternatively, the request 
may be allowed to proceed, but as data are received from the site they are scanned 
for checking a match with one or more sets of pre-determined words, word 
fragments or phrases. If a match is found the site is not displayed on the computer 
but instead there is shown a suitable message. Typically, this type of control 
software is installed on a PC or work station which does not have particularly strict 
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access privileges. The control software can be easily removed, disabled or 
otherwise circumvented and thereby defeating the control system. 

A network device capable of intercepting the request or reply to a request, 
such as a proxy server, may perform similar actions using the same methods of web . 
5 site matching. This is usually maintained by a network administrator with strict 
access rights. Also, a network requiring clients to connect through the network 
device in order to access the network can have its content control enforced. This 
allows content control of multiple clients from one central point. 

While these known systems do provide some access control abilities, there 
10 are several disadvantages. A system based on word or phrase matching carf only 
match text and it therefore would allow access to undesired information comprising 
graphic images. Also, a single word may match a broad range of sites with quite 
different classes of information. As an example, when the word "sex" is used to 
match pornographic sites the system would also block access to other sites 
15 providing non offensive information such as articles on biology. 

A system based on an access control list of prohibited sites is much more 
selective. Access can only be denied when attempting to access the sites which are 
included in the lists. While a suitably large list could bar access to a great deal of 
undesirable information it is difficult to keep up to date due to the rapid increase 
20 in the number of new sites and removal of sites. 

The above systems also do not lend themselves to adaptation to other 
network protocols and services such as interactive chat, streaming video, email or 
encrypted data streams. Extending to different languages also poses a problem for 
globalisation of these systems. 
25 OB1ECT OF THE INVENTION 

An object of the present invention is to alleviate or to reduce to a certain 
degree one or more of the above disadvantages. 

Another object of the present invention is provide an apparatus/system for 
classifying user profiles. 

30 
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SUMMARY OF THE INVENTION 
In one aspect therefor the present invention resides in an apparatus for 
classifying information on communications network. The apparatus comprises 
means for obtaining one or more transmission characteristics of information on a 
5 path of said communications network, and analysing means for predicting a 
classification of said information based on said one or more transmission 
characteristics. 

In a second aspect therefor the present invention resides in an apparatus for 
classifying content servers which are accessible on a communications network. The 

1 0 apparatus comprises means for obtaining one or more transmission characteristics 
of information provided by any of said content servers on a path of said 
communications network, and analysing means for predicting a classification of said 
information based on said one or more transmission characteristics. 

In a third aspect therefor the present invention resides in a computer 

15 program for classifying information which is accessible on a communications 
network. The program comprises means for obtaining one or more transmission 
characteristics of information on a path of said communications network, and 
analysing means for predicting a classification of said information based on said one 
or more transmission characteristics. 

20 In a fourth aspect therefor the present invention resides in a computer 

program for classifying content servers which are accessible on a communications 
network. The apparatus comprises means for obtaining one or more transmission 
characteristics of information provided by any of said content servers on a path of 
said communications network, analysing means for predicting a classification of 

25 said information based on said one or more transmission characteristics. 

In a fifth aspect therefor the present invention resides in an 
apparatus/computer program for classifying user profiles of users accessing 
information or content servers on a communications network. The 
apparatus/computer program comprises means for obtaining one or more 

30 transmission characteristics of information or information provided by any one of 
said content servers on a path of said communications network, analysing means 
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for predicting a classification of said information or said one content server based 
on said one or more transmission characteristics, and means for classifying user 
profile in accordance with the predicted classification. 

The above invention may also comprise means for storing said one or more 
5 transmission characteristics. 

■ 

Typically said one or more transmission characteristics include any one or 
more of network protocol, date and time stamps, size of transmission activities (text 
and image), content type of transmission activities, pattern seen within the content 
of the transmission and any other characteristic that can be employed for predicting 
10 classifications. ^ 

In preference said one or more transmission characteristics are obtained from 
network packets or fragments thereof. 

It is also preferred that the analysing means includes profiling means for 
providing profiles of interactions based on said one or more transmission 
15 characteristics. Typically said profiling means is arranged to process said one or 
more transmission characteristics for providing any one or more of frequency of 
interaction, duration of interaction, duration of absence of interaction, patterns of 
transmission, average number of http links within an object of related sites, average 
number of like sites visited within a time frame, and statistics from said other 
20 characteristics, for forming interaction profiles. The analysing means can then use 
the profiles for predicting classifications. 

The invention may have a knowledge base of predetermined profiles, and 
the analysing means is adapted to predict a classification based on a comparison 
between the profile of information to be classified and predetermined profiles. 
25 Advantageously the invention may have means for updating the knowledge 

base so that the classification prediction may be enhanced following classifications. 

In order that the present invention can be more readily understood and be 
put into practical effect reference will now be made to the accompanying drawings 
which illustrate one preferred embodiment of the invention and wherein: 
30 BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 is a schematic diagram of the apparatus according to the invention; 
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Figure 2 is a table of selected data of captured packets of a search engine 
using the apparatus shown in Figure 1; 

Figure 3 is a partial table of selected data of captured packets of a news web 
site using the apparatus shown in Figure 1; 
5 Figure 4 is a table of selected data of captured packets of an entertainment 

web site using the apparatus shown in Figure 1 ; 

Figure 5 is a table of selected data of captured packets of the web site of an 
e-commerce merchant using the apparatus shown in Figure 1; 

Figure 6 is a table of selected data of captured packets of the web site of 
10 another e-commerce merchant using the apparatus shown in Figure 1; 

Figure 7 is a table of selected data of captured packets of a pornography web 
site using the apparatus shown in Figure 1; 

Figure 8 is a table of selected data of captured packets of another 
pornography web site using the apparatus shown in Figure 1; 
1 5 Figure 9 is a table of model N 1 results using the apparatus shown in Figure 

i; 

Figure 10 is a table of model N2 results using the apparatus shown in Figure 

i; 

Figure 1 1 is a table of model N3 results using the apparatus shown in Figure 

20 1 ; and 

Figure 12 is a table of classification prediction confidence levels using the 
apparatus shown in Figure 1 . 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
Referring initially to Figure 1 there is shown an apparatus 10 for classifying 
25 media or information flowing through a path of a communications network which 
in this case is the Internet. 

As can be seen, network traffic passing through the apparatus 1 0 is captured 
and analysed for providing statistics relating to interactions between two or .more 
terminals (not shown). The captured traffic is first checked against a list of 
30 predetermined classifications to determine if it is known or unknown. 
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When the captured traffic is of an unknown classification, various models 
(to be described more fully below) are applied to the data set in the captured traffic 
in order to predict the content classification. The models use parameters derived 
from a knowledge base of previously classified data sets and fitness with these . 
5 parameters to determine the classification of the content of the newly captured 
traffic. Thus, the web site sending the captured traffic is now classified and is added 
to the list of known classifications. 

It should be noted that the embodiment of the apparatus 10 as described 
herein is for an analysis of transmission traffic using the HTTP protocol. The 
10 apparatus 10 according to the present invention is not restricted to HTTP, and 

is easily adaptable to analyse data carried within any networks using any known 
protocol. Examples of the protocols include FTP, SMTP, NNTP, etc. 

Following classification the captured data set is stored in the knowledge 
base. As the knowledge base expands, more data are used for the model 
15 parameters. This refines the apparatus and results in improved predictive 
performance. 

The sites that are deemed to include undesirable information are added to 
Access control lists (ACLs). The ACLs are used control the flow of content 
information between terminals. E.g. Undesired content information can be 
20 prevented from travelling further through the network by simply not forwarding it, 
or by replacing it, or by intercepting the request for such content information and 
modifying its destination. 

Classification of traffic from content servers are relatively static. On the other 
hand, user terminals that interact with these content servers are variable and their 
25 classifications are considered transient classifications. 

Whereas classifications of content servers form a model of the style of 
content residing on the server, transient classifications form a model of style of 
content being viewed by a user terminal, or content consumer. This in effect forms 
a behaviour profile of such a consumer. This profile can be used to tailor the 
30 content information to suit the consumer. 
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As mentioned earlier the apparatus 10 captures a set of observed data 
relating to a network interaction event, and provides a set of results indicating the 
classification of a resource or personality residing at each network node involved 
in the interaction. This is accomplished by applying various statistical models to a 
5 profile, and testing this against results obtained from profiles of known 
classifications. In this example of the invention this process is represented by the 
following formulas: 

x is an unknown profile to be classified; 

Profiles p1,p2,p3...pn are of known classifications; 
10 Models M1,M2,M3...Mn are available to operate on these profiles; and ' 

C1,C2,C3...Cn are profile classifications. 

The population of a profile of classification CI, may be defined by the 

population of Ml (p). Ml (x) may be tested against the true population using any of 

the standard statistical hypothesis methods. 
1 5 A predetermined set of media terminals of a classification are modelled by 

various models Ml, M2 Mn. Each model consists of an approach and a set of 

parameter, e.g linear regression, gradient and point of interception, so that for a 

single classification M1(pl,p2 .. pn), M2(q1,q2 .. qn) .. Mn(rl,r2 rn) are used to 

model the population from the classification. The models may be based on 
20 mathematical structures, or arbitrary rules. 

The models are continually refined as more network traffic passes through 

the apparatus 10, thereby increasing the population space from which the 

classifications are computed. 

A terminal may be permanently or transitionally defined in relation to a 
25 classification. A transitionally defined terminal may move between classifications 

based on the fitness of the observed traffic to the models of the various 

classifications. 

Figures 2 to 8 are tables of selected data of traffic for testing the profile of 
data during a network interaction with a content server to determine if it contains 
30 media content of a pornographic nature. Assumption is made that profiles for 
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content servers contain a variable which is the average size of graphical images 
served. 

A normal distribution or similar non-deterministic probability distribution is 
then used to test the hypothesis that the profile belongs to a population classified* 
5 as pornographic. In this example, the population of the classification may be 
defined by the population of N(a,b) where N is the image size and a and b are the 
mean and variance respectively, based on a normal distribution. The average and 
standard deviation derived from the observed samples is tested against the true 
population using standard statistical hypothesis methods. 
10 In some cases this approach may be broadened to encompass analysis of 

variance methods with multiple dependant variables, to model the characteristics 
of a site. Traditional ANOVA or regressive techniques may be applied to model the 
media content. 

A variety of traditional deterministic and non-deterministic models may be 
1 5 applied to determine the hypothesis of profile classification. These may be changed 
or upgraded continually depending on the level of predictive power found. The 
functionality of models used is not limited to, but can include simple rules-of- 
thumb, deterministic and non-deterministic probability models, or arbitrary 
calculations. 

20 The choice of model is primarily dictated by the predictive power of that 

model against the population in question. 

Figures 2 through 8 show examples of basic data set that can be gathered by 
observing network traffic of a typical interaction between a client browser and a 
web server. 

25 Figures 9 to 1 1 illustrate a simple classification model. This model looks at 

the size, content and relationships of objects being transmitted by a content server. 

The outcome of this model is to determine if the media being transmitted has 

pornographic content. 

Classification: pornographic 
30 Standard Model: 

N1(a,b) 



E.VS30CID: <WO. 



C352533A1J_> 



1 



WO 00/52598 PCT/AUO 0/00 158 

9 

Where N1 is the image size, a and b are the mean and variance respectively, based 

on a normal distribution. 

N2(c,d) 

Where N2 is the ratio of text to graphics, c and d are the total size of the text and 
5 graphic objects respectively. 
N3(e) 

Where N3 is the count of word patterns matched from a list of pre-determined 
words, and e is the text of an object. 

Observed Samples are given in the tables shown in Figures 2 to 8. 
10 For model N1 shown in Figure 9, there is applied the normal distribution 

hypothesis test to the observed samples deriving the results. 

The result shows confidence to the 93% and 87% level for sites 6 and 7 
respectively, that the sites belong to a population of pornographic sites. The other 
samples give much lower confidence levels. 
1 5 For model N2 shown in Figure 1 0, a simple rule is used to test if the ratio is 

below a pre-determined threshold. The results show that sites 2, 4, 6 and 7 are 
within the threshold rating. 

For Model N3 shown in Figure 11, a simple rule is used to test if the 
number of words matching a list of patterns, exceeds a pre-determined threshold. 
20 The results show that sites 6 and 7 exceed the threshold. 

A weighting formula is then applied to derive a final result as shown in 
Figure 12. 

Therefore, using this example model, the apparatus 10 would predict that 
sites 6 and 7 are probably serving media with pornographic content, whereas sites 
25 1 through 5 probably are not. 

The attached appendix shows an example of the set of rules, constants and 
formulas which determine a confidence prediction based on logistic regression. The 
rules are defined using "Submodel" and "Model" components to define individial 
data points, and aggregated data points. These are then referred to in the 
30 "ProbabilityAnalyser" equations which use standard predictive formulas. 
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Whilst the above has been given by way of illustrative example of the 
present invention many variations and modifications thereto will be apparent to 
those skilled in the art without departing from the broad ambit and scope of the 
invention as herein set forth. 
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CLAIMS 

1 . An apparatus for classifying information on communications network, the 
apparatus comprises means for obtaining one or more transmission characteristics 
of information on a path of said communications network, and analysing means for 
predicting a classification of said information based on said one or more 
transmission characteristics. 

2. An apparatus for classifying content servers which are accessible on a 
communications network, the apparatus comprises means for obtaining one or 
more transmission characteristics of information provided by any of said content 
servers on a path of said communications network, and analysing means for 
predicting a classification of said information based on said one or more 
transmission characteristics. 

3. A computer program for classifying information which is accessible on a 
communications network, the program comprises means for obtaining one or more 
transmission characteristics of information on a path of said communications 
network, and analysing means for predicting a classification of said information 
based on said one or more transmission characteristics. 

4. A computer program for classifying content servers which are accessible on 
a communications network, the apparatus comprises means for obtaining one or 
more transmission characteristics of information provided by any of said content 
servers on a path of said communications network, analysing means for predicting 
a classification of said information based on said one or more transmission 
characteristics. 

5. An apparatus for classifying user profiles of users accessing information or 
content servers on a communications network, the apparatus comprises means for 
obtaining one or more transmission characteristics of information or information 
provided by any one of said content servers on a path of said communications 
network, analysing means for predicting a classification of said information or said 
one content server based on said one or more transmission characteristics, and 
means for classifying user profile in accordance with the predicted classification. 
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6. A computer program for classifying user profiles of users accessing 
information or content servers on a communications network, the program 
comprises means for obtaining one or more transmission characteristics of 
information or information provided by any one of said content servers on a path- 
of said communications network, analysing means for predicting a classification of 
said information or said one content server based on said one or more transmission 
characteristics, and means for classifying user profile in accordance with the 
predicted classification. 

7. The invention according to any one of claims 1 to 6 further comprising 
means for storing said one or more transmission characteristics. ^ 

8. The invention according to any one of claims 1 to 7 wherein said one or 
more transmission characteristics include any one or more of network protocol, 
date and time stamps, size of transmission activities (text and image), content type 
of transmission activities, pattern seen within the content of the transmission and 
any other characteristic that can be employed for predicting classifications. 

9. The invention according to any one of claims 1 to 8 wherein said one or 
more transmission characteristics are obtained from network packets or fragments 
thereof. 

10. The invention according to any one of claims 1 to 9 wherein the analysing 
means includes profiling means for providing profiles of interactions based on said 
one or more transmission characteristics. 

11. The invention according to claim 10 said profiling means is arranged to 
process said one or more transmission characteristics for providing any one or more 
of frequency of interaction, duration of interaction, duration of absence of 
interaction, patterns of transmission, average number of http links within an object 
of related sites, average number of like sites visited within a time frame, and 
statistics from said other characteristics, for forming interaction profiles, and the 
analysing means is adapted to use the profiles for predicting classifications. 

1 2. The invention according to any one of claims 1 to 1 1 further comprising a 
knowledge base of predetermined profiles, and the analysing means is adapted to 
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predict a classification based on a comparison between the profile of information 
to be classified and predetermined profiles. 

1 3. The invention according to claim 1 2 further comprising means for updating 
the knowledge base so that the classification prediction can be enhanced following 
classifications. 
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Appendix 

# 

#Body Text Word Ratio Models 
5 SubModel Param AllWordCount WordList All Words 
SubModel Param AllWordCount Context BODY 



#Body Text Unique Word Ratio Models 
SubModel Param AllWordCountUnique WordList AllWords 
10 SubModel Param AllWordCountUnique Context BODY 

# 

#Meta Text Word Ratio Models 

SubModel Param AllMetaWordCount WordList AllWords 
1 5 SubModel Param AllMetaWordCount Context META 

#- 

#Alternate Text Word Ratio Models 

SubModel Param All Alternate WordCount WordList AllWords 
SubModel Param AllAlternateWordCount Context ALTERNATE 

20 #- — 

#lmage models 

SubModel Param LargeGIFPictureCount Dimension 201 x 201 - 999 x 999 

SubModel Param LargeGIFPictureCount ImageType GIF 

SubModel Param ThumbnailGIFPictureCount Dimension 51 x 51 - 200 x 200 
25 SubModel Param ThumbnailGIFPictureCount ImageType GIF 

SubModel Param IconGIFPictureCount Dimension 5 x 5 - 50 x 50 

SubModel Param IconGIFPictureCount ImageType GIF 

SubModel Param AllGIFPictureCount ImageType GIF 

Model Exp LargeGIFPictureRatio RATIO(LargeGIFPictureCount, 
30 AllGIFPictureCount) 
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Model Exp ThumbnailGIFPictureRatio RATIOfThumbnailGIFPictureCount, 
AllCIFPictureCount) 

Model Exp IconGIFPictureRatio RATIO(lconGIFPictureCount, 
5 AllGIFPictureCount) 
# 

SubModel Param LargeJPEGPictureCount Dimension 201 x 201 - 999 x 999 

SubModel Param LargeJPEGPictureCount ImageType JPEG 

SubModel Param ThumbnailJPEGPictureCount Dimension 51 x51 - 200 x 200 

10 SubModel Param ThumbnailJPEGPictureCount ImageType JPEG 

SubModel Param IconJPEGPictureCount Dimension 5 x 5 - 50 x 50 
SubModel Param IconJPEGPictureCount ImageType JPEG 
SubModel Param AllJPEGPictureCount ImageType JPEG 
Model Exp LargeJPEGPictureRatio RATIO(LargeJPEGPictureCount, 

15 AllJPEGPictureCount) 

Model Exp ThumbnailJPEGPictureRatio RATIO(ThumbnailJPEGPictureCount, 
AllJPEGPictureCount) 

Model Exp IconJPEGPictureRatio RATIO(lconJPEGPictureCount, 
AllJPEGPictureCount) 

20 # 

SubModel Param LowDepthGIFPictureCount Depth 2-4 
SubModel Param LowDepthGIFPictureCount ImageType GIF 
SubModel Param MediumDepthGIFPictureCount Depth 5-6 
SubModel Param MediumDepthGIFPictureCount ImageType GIF 

25 SubModel Param HighDepthGIFPictureCount Depth 7-16 
SubModel Param HighDepthGIFPictureCount ImageType GIF 
Model Exp LowDepthGIFPictureRatio RATIO(LowDepthGIFPictureCount, 
AllGIFPictureCount) 

* 

Model Exp MediumDepthGIFPictureRatio 
30 RATIOCMediumDepthGIFPictureCount, AllGIFPictureCount) 
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Model Exp HighDepthGIFPictureRatio RATIO(HighDepthGIFPictureCount, 

AllCIFPictureCount) 

# 

5 SubModel Param LowDepthJPEGPictureCount Depth 2-7 

SubModel Param LowDepthJPEGPictureCount ImageType JPEG 
SubModel Param MediumDepthJPEGPictureCount Depth 8-15 
SubModel Param MediumDepthJPEGPictureCount ImageType JPEG 
SubModel Param HighDepthJPEGPictureCount Depth 16-36 
10 SubModel Param HighDepthJPEGPictureCount ImageType JPEG ' 
Model Exp LowDepthJPEGPictureRatio RATIO(LowDepthJPEGPictureCount, 
AllJPEGPictureCount) 

Model Exp MediumDepthJPEGPictureRatio 
RATIO(MediumDepthJPEGPictureCount, AllJPEGPictureCount) 
15 Model Exp HighDepthJPEGPictureRatio RATIO(HighDepthJPEGPictureCount, 
AllJPEGPictureCount) 

#_ 

# Links Out Models 

SubModel Param AllLinkOutCount IncludeLocal FALSE 
20 # 

SubModel Param AVSLinkOutCount Classification ADULTVERIFICATION 

SubModel Param AVSLinkOutCount IncludeLocal FALSE 

Model Exp AVSLinkOutRatio RATIO(AVSLinkOutCount, AllLinkOutCount) 

25 # begin porn.conf 

#Body Text Word Count Models 

SubModel Param PornExtraHardWordCount WordFile 

models/d i cti onary/porn/porn_words_extrahard .txt 

SubModel Param PornHardWordCount WordFile 
30 models/dictionary/porn/porn_words_hard.txt 
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SubModel Param PornMediumWordCount WordFile 
models/dictionary/porn/porn_words_medium.txt 
SubModel Param PornLiteWordCount WordFile 
5 models/dictionary/porn/porn_words_lite.txt 

m 

SubModel Param PornExtraLiteWordCount WordFile 

models/dictionary/porn/porn_words_extralite.txt 

# — 

#Unique Body Text Word Count Models 

10 

SubModel Param PornExtraHardWordCountUnique WordFile 

models/dictionary/porn/porn^words^extrahard.txt 

SubModel Param PornExtraHardWordCountUnique Mode Distinct 

SubModel Param PornHardWordCountUnique WordFile 

1 5 models/dictionary/porn/porn_words_hard.txt 

SubModel Param PornHardWordCountUnique Mode Distinct 

SubModel Param PornMediumWordCountUnique WordFile 

models/dictionary/porn/porn_words__medium.txt 

SubModel Param PornMediumWordCountUnique Mode Distinct 

20 SubModel Param PornLiteWordCountUnique WordFile 
models/dictionary/porn/porn_wordsJite.txt 
SubModel Param PornLiteWordCountUnique Mode Distinct 
SubModel Param PornExtraLiteWordCountUnique WordFile 
models/dictionary/porn/porn_words - extralite.txt 

25 SubModel Param PornExtraLiteWordCountUnique Mode Distinct 
# 

#Body Text Word Ratio Models 

Model Exp PornTextWordRatioExtraHard RATICKPornExtraHardWordCount, 
AllWordCount) 

30 Model Exp PornTextWordRatioHard RATIO(PornHardWordCount, 
AllWordCount) 
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Model Exp PomTextWordRatioMedium RATIO(PornMediumWordCounc, 
AllWordCount) 

Model Exp PornTextWordRatioLite RATIO(PornLiteWordCount, AllWordCount) 
5 Model Exp PornTextWordRatioExtraLite RATIO(PornExtraLiteWordCount, 
AllWordCount) 

# 

#Body Text Unique Word Ratio Models 
Model Exp PornTextWordRatioExtraHardUnique 
10 RATIO(PornExtraHardWordCountUnique, AllWordCountUnique) 
Model Exp PornTextWordRatioHardUnique 
RATIO(PornHardWordCountUnique, AllWordCountUnique) 

Model Exp PomTextWordRatioMedium Unique 
1 5 RATIO(PornMediumWordCountUnique, AllWordCountUnique) 

Model Exp PornTextWordRatioLiteUnique RATIO(PornLiteWordCountUnique, 
AllWordCountUnique) 

Model Exp PornTextWordRatioExtraLiteUnique 
RATIO(PornExtraLiteWordCountUnique, AllWordCountUnique) 

20 # 

#Domain Word Count Models 

SubModel Param PornExtraHardDomainWordCount Context DOMAIN-NAME 
SubModel Param PornExtraHardDomainWordCount WordFile 
models/dictionary/porn/porn_words_extrahard.txt 
25 SubModel Param PornHardDomainWordCount Context DOMAIN-NAME 
SubModel Param PornHardDomainWordCount WordFile 
models/dictionary/porn/porn_words_hard.txt 

SubModel Param PornMediumDomainWordCount Context DOMAIN-NAME 
SubModel Param PornMediumDomainWordCount WordFile 
30 models/dictionary/porn/porn_words_medium.txt 

SubModel Param PornLiteDomainWordCount Context DOMAIN-NAME 
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SubModel Param PornLiteDomainWordCount WordFile 
models/dictionary/porn/porn_wordsJite.txt 

SubModel Param PornExtraLiteDomainWordCount Context DOMAIN-NAME 
5 SubModel Param PornExtraLiteDomainWordCount WordFile 
models/dictionary/porn/porn_words_extralite.txt 

# 

#Meta Text Word Count Models 

SubModel Param PornExtraHardMetaWordCount Context META 
10 SubModel Param PornExtraHardMetaWordCount WordFile 
models/dictionary/porn/porn_words_extrahard.txt 
SubModel Param PornHardMetaWordCount Context META 
SubModel Param PornHardMetaWordCount WordFile 
models/dictionary/porn/porn_words_hard.txt 

15 

SubModel Param PornMediurnMetaWordCount Context META 
SubModel Param PornMediurnMetaWordCount WordFile 
models/dictionary/porn/porn_words_medium.txt 
SubModel Param PomLiteMetaWordCount Context META 

20 SubModel Param PomLiteMetaWordCount WordFile 
models/dictionary/porn/porn_words_lite.txt 
SubModel Param PornExtraLiteMetaWordCount Context META 
SubModel Param PornExtraLiteMetaWordCount WordFile 
models/dictionary/porn/porn_words_extralite.txt 

25 # 

#Meta Text Word Ratio Models 

Model Exp PornMetaWordRatioExtraHard 

RATIO(PornExtraHardMetaWordCount, AllMetaWordCount) 

Model Exp PornMetaWordRatioHard RATIO(PornHardMetaWordCount, 

30 AllMetaWordCount) 
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Model Exp PornMetaWordRatioMedium RATIO(PornMediumMetaWordCount, 
AllMetaWordCount) 

Model Exp PornMetaWordRatioLite RATIO(PornLiteMetaWordCount, 
5 AllMetaWordCount) 

Model Exp PornMetaWordRatioExtraLite RATIO(PornExtraLiteMetaWordCount, 
AllMetaWordCount) 

# 

#Alternate Text Word Count Models 
10 SubModel Param PornExtraHardAlternateWordCount Context ALTERNATE" 

SubModel Param PornExtraHardAltemateWordCount WordFile 

mod e I s/d i ct i on a ry/porn/porn_word s_ext rahard . txt 

SubModel Param PornHardAlternateWordCount Context ALTERNATE 

SubModel Param PornHardAlternateWordCount WordFile 
1 5 models/dictionary/porn/porn_words_hard.txt 

SubModel Param PornMediumAlternateWordCount Context ALTERNATE 

SubModel Param PornMediumAlternateWordCount WordFile 

models/dictionary/porn/porn_words_medium.txt 

SubModel Param PornLiteAlternateWordCount Context ALTERNATE 
20 SubModel Param PornLiteAlternateWordCount WordFile 

models/dictionary/porn/porn_words_lite.txt 

SubModel Param PornExtraLiteAtternateWordCount Context ALTERNATE 
SubModel Param PornExtraLiteAlternateWordCount WordFile 
models/dictionary/porn/porn_words_extralite.txt 

25 # 

#Alternate Text Word Ratio Models 

Model Exp PornAlternateWordRatioExtraHard 

RATIO(PornExtraHardAlternateWordCount, AllAlternateWordCount) 
Model Exp PornAlternateWordRatioHard RATIO(PornHardAlternateWordCount, 
30 AllAlternateWordCount) 
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Model Exp PornAlternateWordRatioMedium 
RATIO(PornMediumAlternateWordCount, AllAlternateWordCount) 
Model Exp PornAlternateWordRatioLite RATIO(PornLiteAlternateWordCount, 
5 AllAlternateWordCount) 

Model Exp PornAlternateWordRatioExtraLite 
RATIO(PornExtraLiteAlternateWordCount, Al lAlternateWordCount) 

#• 

#Links Out Models 

10 SubModel Param PornLinkOutCount Classification PORN ^ 
SubModel Param PornLinkOutCount IncludeLoca! FALSE 
Model Exp PornLinkOutRatio RATIO(PornLinkOutCount, AllLinkOutCount) 
# 

#Logistic Models 
1 5 Model Exp PomLRConstant -3.9869 
# 

Model Exp PornLRCoefficientPornTextWordRatioExtraHard 39.7450 

Model Exp PornLRCoefficientPornTextWordRatioHard 355.0550 
20 Model Exp PornLRCoefficientPornTextWordRatioMedium -136.436 
Model Exp PornLRCoefficientPornTextWordRatioLite -63.2565 
Model Exp PornLRCoefficientPornTextWordRatioExtraLite 33.9054 
# 

Model Exp PornLRCoefficientPornTextWordRatioExtraHardUnique 111.4752 
25 Model Exp PornLRCoefficientPornTextWordRatioHardUnique -72.7005 

Model Exp PornLRCoefficientPornTextWordRatioMediumUnique 264.1902 
Model Exp PornLRCoefficientPornTextWordRatioLiteUnique 125.0743 
Model Exp PornLRCoefficientPornTextWordRatioExtraLiteUnique -16.6895 
# 

30 Model Exp PornLRCoefficientPornExtraHardDomainWordCount 0.2598 
Model Exp PornLRCoefficientPornHardDomainWordCount 2.1344 
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10 



15 



20 



25 



30 



Mode 
Mode 
Mode 
# 



Mode 
Mode 
Mode 
Mode 
Mode 
# 



Mode 
Mode 
Mode 
Mode 
Mode 
# 



Mode 
Mode 



Mode 
Mode 
Mode 
Mode 
Mode 
Mode 
Mode 
Mode 
Mode 
Mode 
Mode 



Exp PornLRCoefficientPornMediumDomainWordCount 0 
Exp PornLRCoefficientPornLiteDomainWordCount 0.0610 
Exp PornLRCoefficientPornExtraLiteDomainWordCount 0 



Exp PornLRCoefficientPomMetaWordRatioExtraHard 0 
Exp PornLRCoefficientPomMetaWordRatioHard 0 
Exp PornLRCoefficientPornMetaWordRatioMedium 0 
Exp PomLRCoefficientPornMetaWordRatioLite 0 
Exp PornLRCoefficientPornMetaWordRatioExtraLite 0 



Exp PornLRCoefficientPornAlternateWordRatioExtraHard 16.1972 

Exp PornLRCoefficientPornAlternateWordRatioHard 0 

Exp PornLRCoefficientPornAlternateWordRatioMedium 26.4186 

Exp PornLRCoefficientPornAlternateWordRatioLite 0 

Exp PornLRCoefficientPornAIternateWordRatioExtraLite 14.1615 



Exp PornLRCoefficientAlICIFPictureCount 0 
Exp PornLRCoefficientLargeGIFPictureCount 0 



Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 
Exp PornLRCoeffic 



entlconGIFPictureCount 0 
entThumbnailGIFPictureCount 0 
entLargeGIFPictureRatio 0 
entlconGIFPictureRatio 0 
entThumbnailGIFPictureRatio 0 
entHighDepthGIFPictureCount 0 
entMediumDepthGIFPictureCount 0 
entLowDepthGIFPictureCount 0 
entHighDepthGIFPictureRatio 0 
entMediumDepthGIFPictureRatio 0 
entLowDepthGIFPictureRatio 0 
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Model Exp PornLRCoefficientAllJPEGPictureCount 0 
Model Exp PornLRCoefficientLargeJPEGPictureCount 0 
5 Model Exp PornLRCoefficientlconJPEGPictureCount 0 

Model Exp PornLRCoefficientThumbnailJPEGPictureCount 0 
Model Exp PornLRCoefficientLargeJPEGPictureRatio 0 
Model Exp PornLRCoefficientlconJPEGPictureRatio 0 
Model Exp PomLRCoefficientThumbnailJPECPictureRatioO 

10 Model Exp PornLRCoefficientHighDepthJPECPictureCount 0 

Model Exp PornLRCoefficientMediumDepthjPECPictureCount 0 
Model Exp PornLRCoefficientLowDepthJPEGPictureCount 0 
Model Exp PornLRCoefficientHighDepthJPEGPictureRatio 0 
Model Exp PornLRCoefficientMediumDepthJPEGPictureRatio 0 

15 Model Exp PornLRCoefficientLowDepthJPEGPictureRatio 0 
# 

Model Exp Porn LRCoefficientPornLinkOutRatio 4.6958 
Model Exp PornLRCoefficientAVSLinkOutCount 0.3327 
Model Exp PornLRCoefficientAVSLinkOutRatio 3.6786 

20 # 

Model Exp PornLRLogOdds SUM(PomLRConstant, \ 

PRODUCT(PornLRCoefficientPornTextWordRatioExtraHard, 

PornTextWordRatioExtraHard), \ 

PRODUCT(PornLRCoefficientPornTextWordRatioHard, 

25 PornTextWordRatioHard), \ 

PRODUCKPornLRCoefficientPornTextWordRatioMedium, 

PornTextWordRatioMedium), \ 

PRO D U CT(Porn LRCoef f i ci entPornTextWord Rati oL ite, 

PornTextWordRatioLite), \ 
30 PRODUCKPornLRCoefficientPornTextWordRatioExtraLite, 

PornTextWordRatioExtraLite), \ 
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PRODUCKPomLRCoefficientPornTextWordRatioExtraHardUnique, 
PornTextWordRatioExtraHardUnique), \ 

PRODUCTfPornLRCoefficientPomTextWordRatioHardUnique, 
5 PornTextWordRatioHardUnique), \ 

PRODUCTCPornLRCoefficientPornTextWordRatioMediumUnique, 
PornTextWordRatioMediumUnique), \ 

PRODUCT(PornLRCoefficientPornTextWordRatioLiteUnique, 

PornTextWordRatioLiteUnique), \ 
10 PRODUCT(PornLRCoefficientPornTextWordRatioExtraLiteUnique, 

PornTextWordRatioExtraLiteUnique), \ 

PRODUCT(PornLRCoefficientPornExtraHardDomainWordCount, 
PornExtraHardDomainWordCount), \ 

PRODUCTCPornLRCoefficientPornHardDbmainWordCount, 
1 5 PornHardDomainWordCount), \ 

PRODUCT(PornLRCoefficientPomMediumDomainWordCount / 
PornMedium Domain WordCount), \ 

PRODUCT(PornLRCoefficientPornLiteDomainWordCount / 

PornLiteDomainWordCount), \ 
20 PRODUCTCPornLRCoefficientPomExtraLiteDomainWordCount, 

PornExtraLiteDomainWordCount), \ 

PRODUCr(PornLRCoefficientPornMetaWordRatioExtraHard, 
PornMetaWordRatioExtraHard), \ 

PRODUCTCPornLRCoefficientPornMetaWordRatioHard, 
25 PornMetaWord Ratio Hard), \ 

P RO D UCT(Porn LRCoeff i cientPornMetaWord RatioMed i u m, 
PornMetaWordRatioMedium), \ 

PRODUCT(PornLRCoefficientPornMetaWordRatioLite, 
PornMetaWordRatioLite), \ 
30 PRODUCT(PornLRCoefficientPornMetaWordRatioExtraLite, 
PornMetaWordRatioExtraLite), \ 
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PRODUCTfPornLRCoefficientPornAlternateWordRatioExtraHard, 

PornAlternateWordRatioExtraHard), \ 

PRODUCTCPornLRCoefficientPornAlternateWordRatioHard, 

5 PornAlternateWordRatioHard), \ 

PRODUCT(PornLRCoefficientPornAlternateWordRatioMedium, 

PornAlternateWordRatioMedium), \ 

PRODUCT(PomLRCoefficientPornAlternateWordRatioLite, 

PornAlternateWordRatioLite), \ 
1 0 PRODUCT(PornLRCoefficientPornAlternateWordRatioExtraLite, 

Porn A item at e Word Rat i oExtra Lite), \ 

PRODUCKPornLRCoefficientAllGIFPictureCount, AllGIFPictureCount), \ 
PRODUCT(PornLRCoefficientLargeG!FPictureCount, 

LargeGIFPictureCount), \ 
15 PRODUCT(PornLRCoefficientlconGIFPictureCount, 

IconGIFPictureCount), \ 

PRODUCKPornLRCoefficientThumbnailGIFPictureCount, 

ThumbnailGIFPictureCount), \ 

PRODUCT(PomLRCoefficientLargeGIFPictureRatio, 

20 LargeGIFPictureRatio), \ 

PRODUCT(PornLRCoefficientlconGIFPictureRatio, IconGIFPictureRatio), \ 
PRODUCT(PornLRCoefficientThumbnailGIFPictureRatio / 

ThumbnailGIFPictureRatio), \ 

PRODUCKPornLRCoefficientHighDepthGIFPictureCount, 

25 HighDepthGIFPictureCount), \ 

PRODUCTCPornLRCoefficientMediumDepthGIFPictureCount, 

MediumDepthGIFPictureCount), \ 

PRODUCTlPornLRCoefficientLowDepthGIFPictureCount, 

LowDepthGIFPictureCount), \ 
30 PRODUCTCPomLRCoefficientHighDepthGIFPictureRatio, 

HighDepthGiFPictureRatio), \ 

Substitute Sheet (Rule 26) RO/AU 



ENSDOC1D: <WO. 



.005259BA1J_> 



* 



WO 00/52598 PCT/A U00/00 1 58 



26 



PRODUCT(PornLRCoefficientMediumDepthGIFPictureRatio, 

MediumDepthGIFPictureRatio), \ 

PRODUCKPornLRCoefficientLowDepthCIFPictureRatio, 

5 LowDepthGIFPictureRatio), \ 

PRODUCT(PornLRCoefficientAIIJPEGPictureCount, AllJPEGPictureCount), 

\ 

PRODUCT(PornLRCoefficientLargeJPEGPictureCount, 

LargeJPEGPictureCount), \ 
10 PRODUCT(PomLRCoefficientlconJPEGPictureCount, 

IconJPEGPictureCount), \ 

PRODUCKPornLRCoefficientThumbnailJPEGPictureCount, 

ThumbnailJPEGPictureCount), \ 

PRODUCT(PomLRCoefficientLargeJPEGPictureRatio, 

15 LargeJPEGPictureRatio), \ 

PRODUCT(PornLRCoefficientlconJPECPictureRatio, 

IconJPEGPictureRatio), \ 

PRODUCT(PornLRCoefficientThumbnailJPEGPictureP.atio, 

ThumbnailJPEGPictureRatio), \ 
20 PRODUCT(PornLRCoefficientHighDepthJPEGPictureCount, 

HighDepthJPEGPictureCount), \ 

PRODUCKPornLRCoefficientMediumDepthJPEGPictureCount, 

MediumDepthJPEGPictureCount), \ 

PRODUCTCPornLRCoefficientLowDepthJPEGPictureCount, 

25 LowDepthJPEGPictureCount), \ 

PRODUCT(PornLRCoefficientHighDepthJPEGPictureRatio, 

HighDepthJPEGPictureRatio), \ 

PRODUCKPornLRCoefficientMediumDepthJPEGPictureRatio, 

MediumDepthJPEGPictureRatio), \ 
30 PRODUCT(PornLRCoefficientLowDepthJPEGPictureRatio, 

LowDepthJPEGPictureRatio), \ 
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PRODUCT(PornLRCoefficientPornLinkOutRatio, PornLinkOutRatio), \ 
PRODUCT(PornLRCoefficientAVSLinkOutCount, AVSLinkOutCount), \ 
PRODUCT(PornLRCoefficientAVSLinkOutRatio, AVSLinkOutRatio)) 

# 

#Probability Analysers 

ProbabilityAnalyser Param PornAltMetaWordCountProbability Classification 
PORN 

ProbabilityAnalyser Exp PornAltMetaWordCountProbability \ 

SUM(PornExtraHardMetaWordCount, PornHardMetaWordCount, \ 
PRODUCKO.S.PornMediumMetaWordCount), \ 

• — 

PornExtraHardAlternateWordCount, 
PornHardAlternateWordCount, \ 

PRODUCKO.S^ornMediumAlternateWordCount)) 
ProbabilityAnalyser Param PornMetaWordRatioProbabi I ity Classification PORN 
ProbabilityAnalyser Exp PornMetaWordRatioProbabi I ity \ 

PRODUCT(100, SUM(PornMetaWordRatioExtraHard, \ 

PornMetaWordRatioHard, PornMetaWordRatioMedium)) 

ProbabilityAnalyser Param PornLRProbability Classification PORN 

ProbabilityAnalyser Exp PornLRProbability PRODUCT(100, 

RATIOd^SUMd^XPCMINUSfPornLRLogOdds))))) 

# 
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Site 2 Objects - Content Type "News" 
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165.69.1.187 


80 


42 


165.69.1.187 


80 



Destination 



IP Address 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
165.69.1.187 
165.69.1.187 
165.69.1.187 - 
165.69.1.187 
202.139.16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
202.139.16.45 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
202.139.16.45 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
165.69.1.187 
165.69.1.187 
202.139.16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 



Port_ 

80 
63450 
80 
80 
80 
80 
80 
80 
63450 
63450 
63450 
63450 
63450 
63450 
80 
63450 
80 
63450 
80 
80 
63450 
63450 
80 
63450 
80 
63450 
80 
63450 
80 
80 
63450 
63450 
80 
63450 
80 
80 
80 
80 
63450 
63450 
63450 
63450 



Tj 
GET 
text/html 
GET 
GET 



GET 

GET 

GET 

text/html 

text/html 

text/html 

text/htm! 

text/html 

text/htm! 

GET 

image/gif 

GET 

image/gif 

GET 

GET 

image/gif 

image/gif 

GET 

image/gif 

GET 

image/gif 

GET 

image/gif 

GET 

GET 

unknown 

image/gif 

GET 

image/gif 

GET 

GET 

GET 

GET 

image/gif 
image/gif 
image/gif 
image/gif 



Size 





Timestam 


JO 


14 "54 "04 


c*0 I 


14*^4-04 






EC 

DO 




A C 

4b 




A O 

4o 




A ~~7 

47 




4/ 


1 4.34-v/"j 


333 


^ /t -CA -HA 
l4.D4.UO 


Db 


1 ** .Oh.UO 


O A A C 

2445 






14.D4.UD 


202 


1 4-D4.\Jo 


56 


1 4.D4.UO 


71 


1 4.D4.UD 






04 






1 4* c v4"OB 


f>4 


14*54*06 


64 


14-54:07 


43 


14:54:07 


43 


14:54:07 


67 


14:54:07 


2442 


14:54:07 


73 


14:54:07 


1364 


14:54:07 


71 


14:54:07 


8942 


14:54:07 


71 


14:54:07 


65 


14:54:08 


10 


14:54:08 


15550 


14:54:08 


77 


14:54:08 


4732 


14:54:08 


70 


14:54:08 


70 


14:54:09 


68 


14:54:09 


68 


14:54:09 


436 


14:54:09 


405 


14:54:09 


436 


14:54:09 


405 


14:54:09 



FIG. 3 



ENSDOCID: <WO_ 



„00S259eA1_L> 



WO 00/52598 



PCT/AU00/00I58 
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Rite 2 Objects - Content Type "News" 



Object 





IP Address 


Port 




AAA 4 Oft 4 /4 C 

202.1 oy. 1 o. AO 


63450 


2 


165.69.1 .187 




3 


mAA -ri A/™\ 4 A j4 C 

202.139.16.45 




4 


202.139.16.45 




5 


202.139.16.45 




6 


202.139.16.45 


A a ^4 cA 


7 


202.139.16.45 


Do40u 


8 


202.139.16.45 




9 


165.b9.1.187 




10 


165.69.1.187 


on 
OU 


11 


165.69.1.187 


OU 


12 


165.69.1.187 


OU 


13 


165.69.1.187 


on 
OU 


14 


165.69.1.187 


AH 


15 


AAA •* Aft <4 A ilC 

202.139.16.45 


c'Mcn 
oo^^u 


16 


165.69.1 .187 


ou 


17 


AAA -4 no O ylC 

202.139. 16. 4b 




^ a 

18 


165.69.1 .10/ 


80 


a. 
19 


2U2. 1 03. 1 o.**o 


63450 


OA 
<iU 




63450 




1 a^ fiQ 1 1 R7 


80 




165 69 1 1 87 


80 


23 


202 139.16.45 


63450 


24 


165.69.1 .187 


80 


25 


202.139.16.45 


63450 


26 


165.69.1.187 


80 


27 


202.139.16.45 


63450 


28 


165.69.1.187 


80 


29 


202.139.16.45 


63450 


30 


202.139.16.45 


63450 


•31 


165.69.1.187 


80 


32 


165.69.1.187 


80 


33 


202.139.16.45 


63450 


34 


165.69.1.187 


80 


35 


202.139.16.45 


63450 


36 


202.139.16.45 


63450 


37 


202.139.16.45 


63450 


38 


202.139.16.45 


63450 


39 


165.69.1.187 


80 


40 


165.69.1.187 


80 


41 


165.69.1.187 


80 


42 


165.69.1.187 


80 



IP Address 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
165.69.1.187 
165.69.1-187 
165.69.1.187 
165.69.1.187 
202.139.16.45 
202.139-16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
202.139.16.45 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
202.139.16.45 
202.139.16.45 
165.69.1.187 
202.139.16.45 
165.69.1.187 
165.69.1.187 
165.69.1.187 
165.69.1.187 
202.139.16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 



Port_ 

80 
63450 
80 
80 
80 
80 
80 
80 
63450 
63450 
63450 
63450 
63450 
63450 
80 
63450 
80 
63450 
80 
80 
63450 
63450 
80 
63450 
80 
63450 
80 
63450 
80 
80 
63450 
63450 
80 
63450 
80 
80 
80 
80 
63450 
63450 
63450 
63450 



Type 
GET 
text/html 



GET 

GET 

GET 

GET 

GET 

text/html 

texl/htmi 

text/html 

text/html 

text/html 

text/html 

GET 

image/gif 

GET 

image/gif 

GET 

GET 

image/gif 
image/gif 
GET 
image/gif 
GET 
image/gif 
GET 
image/gif 
GET 
GET 
unknown 
image/gif 
GET 
mage/gtf 
GET 
GET 
GET 
GET 
image/gif 
image/gif 
image/gif 
image/gif 



Size 



38 
2312 
52 
56 
46 
48 
47 
47 
333 
56 
2445 
202 
202 
56 
71 
1229 
64 
43 
64 
64 
43 
43 
67 
2442 
73 
1364 
71 
8942 
71 
65 
10 
15550 
77 
4732 
70 
70 
68 
68 
436 
405 
436 
405 



Timestam 
4:54:04 
4:54:04 
4:54:05 
4:54:05 
4:54:05 
4:54:05 
4:54:05 
4:54:05 
4:54:05 
4:54:06 
4:54:06 
4:54:06 
4:54:06 
4:54:06 
4:54:06 
4:54:06 
4:54:06 
4:54:06 
4:54:06 
4:54:07 
4:54:07 
4:54:07 
4:54:07 
4:54:07 
4:54:07 
4:54:07 
4:54:07 
4:54:07 
4:54:07 
4:54:08 
4:54:08 
4:54:08 
4:54:08 
4:54:08 
4:54:08 
4:54:09 
4:54:09 
4:54:09 
4:54:09 
4:54:09 
4:54:09 
4:54:09 




FIG. 3 



aMSOOCJD: <WO. 



.C0525S3A1 I > 



WO 00/52598 



PCT/AU00/O0158 
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Site 3 Objects - Content Type "Entertainment" 



[Object 



Source 



IP Address 



Port 



Destination 



IP Address 



Port 



Type 



Size 



[Timestarnp 



1 202.139.16.45 

2 204.202. 129.23| 

3 202.139.16.45 

4 204.202.129.23) 

5 202.139.16.45 

6 202.139.16.45 

7 202.139.16.45 

8 204.202.129.2! 

9 204.202.1 29.23! 

10 204.202.129.231 

11 202.139.16.45 

12 202.139.16.45 

13 202.139.16.45 

14 204.202.129.231 

15 '204.202.129.23 

16 204.202.129.23| 

17 202.139.16.45 

18 202.139.16.45 

19 204.202.129.231 

20 204.202.129.23] 

21 202.139.16.45 

22 202.139.16.45 

23 202.139.16.45 

24 202.139.16.45 

25 204.202. 129.23] 

26 204.202.129.23 

27 204.202.129.23 

28 204.202.129.23j 

29 202.139.16.45 

30 202.139.16.45 

31 202.139.16.45 

32 204.202.129.231 

33 204.202.129.23 

34 204.202.129.231 

35 202.139.16.45 

36 204.202.129.23) 

37 202.139.16.45 

38 202.139.16.45 

39 202.139.16.45 

40 202.139.16.45 

41 204.202. 129.231 



42 
43 
44 
45 
46 
47 
48 



204.202.129.23 
204.202.129.23) 
202.139.16.45 
204.202.129.23) 
202.139.16.45 
204.202.129.23] 
204.202.129.23 



63450 204.202. 1 29.23 80 

80 202.139.16.45 63450 

63450 204.202. 1 29.23 80 

80 202.139.16.45 63450 

63450 204.202.129.23 80 

63450 204.202.129.23] 80 

63450 204.202. 1 29.23 80 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

63450 204.202.129.23] 80 

63450 204.202. 1 29 .23 80 

63450 204.202. 1 29.23 80 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

63450 204.202.129.23 80 

63450 204 .202. 1 29.23 80 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

63450 204.202. 1 29.23 80 

63450 204.202. 1 29.23] 80 

63450 204.202. 1 29.23] 80 

63450 204.202.129.23 80 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

63450 204.202. 1 29.23 80 

63450 204.202.129.23] 80 

63450 204.202.1 29.23] 80 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

63450 204.202.129.23 80 

80 202.139.16.45 63450 

63450 204.202.129.23 80 

63450 204.202.129.23 80 

63450 204.202. 1 29.23 80 

63450 204.202.129.23 80 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

80 202.139.16.45 63450 

63450 204 .202. 1 29.23 80 

80 202.139.16.45 63450 

63450 204.202.1 29.23 80 

80 202.139.16.45 63450 

80 202.139.16.45 63450 



GET 
text/html 
GET 
text/html 
GET 
GET 
GET 

image/jpec 
image/gif 
image/gif 
GET 
GET 
GET 
mage/gif 
image/gif 
image/gif 
GET 
GET 
image/gif 
image/gif 
GET 
GET 
GET 
GET 
image/gif 
image/gif 
image/gif 
image/gif 
GET 
GET 
GET 

image/jpec 
image/gif 
image/gif 
GET 
image/gif 
GET 
GET 
GET 
GET 
mage/gif 
mage/gif 
mage/gif 
GET 

mage/jpec 
GET 
image/gif 
mage/gif 



40 14:55:10 
56 14:55:11 
39 14:55:11 
25765 14:55:12 
58 14:55:13 
66 14:55:13 

62 14:55:13 
5277 14:55:14 

43 14:55:14 
1266 14:55:14 

63 14:55:20 
58 14:55:21 
72 14:55:21 

1733 14:55:22 
5314 14:55:22 
414 14:55:22 
68 14:55:22 

62 14:55:22 
406 14:55:22 
746 14:55:22 

65 14:55:23 
58 14:55:23 

63 14:55:23 
62 14:55:23 

1665 14:55:23 
35 14:55:24 
906 14:55:24 
447 14:55:24 
67 14:55:24 
58 14:55:24 
62 14:55:24 
7861 14:55:24 
391 14:55:25 
641 14:55:25 
57 14:55:25 
377 14:55:25 
60 14:55:26 
78 14:55:26 
74 14:55:26 
67 14:55:26 
403 14:55:26 
1796 14:55:26 
6845 14:55:27 
56 14:55:27 
17796 14:55:27 
56 14:55:27 
49 14:55:27 
44 14:55:27 



FIG. 4 



EVSDOCID: <WO. 



0052593A1 J_> 



WO 00/52598 



PCT/AUOO/00158 
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Site 4 Objects - Content Type "Computer Hardware 



Obfect 


ID AHHrtoc'C 


Port 


1 


20^. i jy. l o.**o 


63450 

W w » w w 


2 


OOO 4 >l O *^ Af\ C. 

209.1 4o.^*HJ.o 


BO 


a 

3 


202. 1 J9. lo.«*o 


63450 

Ow^r WW 


4 


AAA 4 yf O O/tn £ 

209. 14o.^4U.b 


80 


5 


202.139.1 b.4o 




6 


ooo 4 on h c >i c 

202.1 39. lb. 4o 




7 


OOO A on < C /IC 

202.1 39. 1 


DJHJU 


8 


209.1 43.240. o 


AO 


9 


A.AO A OO 4 O A C 

202.139.16.45 




10 


209.143.240.6 


on 
ou 


11 


209.143.240.6 




12 


*^.AA w OO 4 & AC 

202.139.16.45 


A34^n 


M o 

13 


ooo h on -i c A C 

202.1 o9.i b.^o 


634 SO 


14 


209. 1 4o.^*hJ. o 


ao 


15 


ooo •{ QQ -\ C ACL 


63450 

WW » W w 


l D 




63450 


1 / 


ooq i^.*^ p/in 6 


80 


1 A 


PHQ 143 240 6 


80 


1 Q 


pn9 139 1 6 45 


63450 


20 


202 139.16.45 


63450 


21 


209.143.240.6 


80 


22 


202.139.16.45 


63450 


23 


209.143.240.6 


80 


24 


209.143.240.6 


80 


25 


209.143.240.6 


80 


26 


202.139.16.45 


63450 


27 


209.143.240.6 


80 


28 


202.139.16.45 


63450 


29 


202.139.16.45 


63450 


30 


202.139.16.45 


63450 


31 


209.143.240.6 


80 


32 


202.139.16.45 


63450 


33 


209.143.240.6 


80 


34 


209.143.240.6 


80 


35 


202.139.16.45 


63450 


36 


209.143.240.6 


80 


37 


202.139.16.45 


63450 


38 


209.143.240.6 


80 


39 


202.139.16.45 


63450 


40 


209.143.240.6 


80 


41 


202.139.16.45 


63450 


42 


209.143.240.6 


80 



IP Address 
209.143.240.6 
202.139.16.45 
209.143.240.6 
202.139.16.45 
209.143.240.6 
209.143.240.6 
209.143.240.6 
202.139.16.45 
209.143.240.6 
202.139.16.45 
202.139.16.45 
209.143.240.6 
209.143.240.6 
202.139.16.45 
209.143.240.6 
209.143.240.6 
202.139.16.45 
202.139.16.45 
209.143.240.6 
209.143.240.6 
202.139.16.45 
209.143.240.6 
202.139.16.45 
202.139.16.45 
202.139.16.45 
209.143.240.6 
202.139.16.45 
209.143.240.6 
209.143.240.6 
209.143.240.6 
202.139.16.45 
209.143.240.6 
202.139.16.45 
202.139.16.45 
209.143.240.6 
202.139.16.45 
209.143.240.6 
202.139.16.45 
209.143.240.6 
202.139.16.45 
209.143.240.6 
202.139.16.45 



Port 

80 
63450 
80 
63450 
80 
80 
80 
63450 
80 
63450 
63450 
80 
80 
63450 
80 
80 
63450 
63450 
80 
80 
63450 
80 
63450 
63450 
63450 
80 
63450 
80 
80 
80 
63450 
80 
63450 
63450 
80 
63450 
80 
63450 
80 
63450 
80 
63450 



Type 
GET 
text/html 
GET 
text/html 
GET 
GET 
GET 
image/gif 
GET 

image/gif 

image/gif 

GET 

GET 

image/gif 

GET 

GET 

image/gif 

image/gif 

GET 

GET 

image/gif 

GET 

image/gif 

image/gif 

image/gif 

GET 

image/gif 

GET 

GET 

GET 

image/gif 

GET 

image/gif 

image/gif 

GET 

image/gif 

GET 

image/gif 

GET • 

image/gif 

GET 

image/gif 



Size 



34 
1645 
47 
56 
64 
63 
63 
290 
64 
403 
381 
63 
65 
348 
65 
65 
354 
600 
65 
54 
490 
53 
571 
322 
571 
55 
363 
63 
63 
62 
241 
59 
488 
463 
59 
714 
53 
35 
53 
1188 
54 
327 



Timestam i 
4:55:49 
4:55:50 
4:55:51 
4:55:51 
4:55:53 
4:55:53 
4:55:53 
4:55:53 
4:55:54 
4:55:54 
4:55:54 
4:55:54 
4:55:54 
4:55:54 
4:55:54 
4:55:54 
4:55:54 
4:55:54 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:55 
4:55:56 
4:55:56 
4:55:56 
4:55:56 
4:55:56 
4:55:56 
4:55:56 
4:55:56 
4:55:56 
4:55:55 
4:55:56 
4:55:57 



FIG. 5 



ENSDOCID- <WO. 



C0525SaAl_t_> 



WO 00/52598 



PCT/AU00/00158 
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Object 



Conte nt Type "C omi 

Destination 



uter Software 






IP Address 


Port 


1 

I 


202 139 16.45 


63450 


o 

c. 


907 4.6 1 30 1 49 


80 


Q 

o 


OCiO 1 39 1 6 45 


63450 


A 


907 46 1 30 1 49 


80 


c 
D 


ono 1 1Q 1 6 45 


63450 


© 


0CY7 46 130 149 


80 


f 


ono 1 iq 1 6 45 


63450 


0 
O 


on7 Aft 1 in 1 4.9 


80 


9 


OAO i QQ 1 ft A^ 


63450 


10 




63450 


1 1 




63450 


12 




63450 


•4 0 
10 


on"7 Aft 1 in 1 49 


80 


.1 ** 


1 1Q 1 6 45 


63450 


ID 


on7 Aft 1 10 1 49 

£\J i .*frO. 1 WW. 1 **W 


80 


1 O 


on*7 Aft 1 30 1 49 


80 


1 / 


OHT Aft 1 10 1 49 

£\J f .*fO. I WW. 1 "TW 


80 


18 


202.139.16.45 


63450 


19 


207.46.130.149 


80 


20 


207.46.130.149 


80 


21 


202.139.16.45 


63450 


22 


207.46.130.149 


80 


23 


202.1 39. 1 6. 45 


63450 


24 


202.139.16.45 


63450 


25 


202.139.16.45 


63450 


26 


202.139.16.45 


63450 


27 


207.46.130.149 


80 


28 


207.46.130.149 


80 


29 


207.46.130.149 


80 


30 


207.46.130.149 


80 


31 


202.139.16.45 


63450 


32 


207.46.130.149 


80 



IP Address 
207.46.130.149 
202.139.16.45 
207.46.130.149 
202.139.16.45 
207.46.130.149 
202.139.16.45 
207.46.130.149 
202.139.16.45 
207.46.130.149 
207.46.130.149 
207.46.130.149 
207.46.130.149 
202.139.16.45 
207.46.130.149 
202.139.16.45 
202.139.16.45 
202.139.16.45 
207.46.130.149 
202.139.16.45 
202.139.16.45 
207.46.130.149 
202.139.16.45 
207.46.130.149 
207.46.130.149 
207.46.130.149 
207.46.130.149 
202.139.16.45 
202.139.16.45 
202.139.16.45 
202.139.16.45 
207.46.130.149 
202.139.16.45 



Port_ 

80 
63450 
80 
63450 
80 
63450 
80 
63450 
80 
80 
80 
80 
63450 
80 
63450 
63450 
63450 
80 
63450 
63450 
80 
63450 
80 
80 
80 
80 
63450 
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